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Abstract 

Obtaining  information  about  the  surface  material  composition  of  man-made  objects  such  as  earth  orbiting  satellites  is  an  im¬ 
portant  goal  of  space  object  identification  (SOI).  Often  initial  information  about  the  possible  material  composition  of  such  targets 
is  known  a  priori ,  at  least  in  a  statistical  sense,  in  the  form  of  a  material  database  which  includes  the  measured  reflectance  curves  of 
materials  most  often  used  in  construction  of  satellites.  In  fact,  such  a  database  serves  as  an  appropriate  sparse  basis  for  modeling  the 
target  using  a  coded  spectral  imaging  system.  A  coded  imaging  system  exploits  the  fact  that  images  of  man-made  objects  posses 
correlations  in  their  physical  structure.  These  correlations,  which  can  occur  spatially  as  well  as  spectrally,  can  suggest  a  more 
natural  sparse  basis  for  compressing  and  representing  the  scene  when  compared  to  standard  pixels  or  voxels.  A  coded  imaging 
system  attempts  to  acquire  and  encode  the  scene  in  this  sparse-basis,  while  preserving  all  relevant  information  in  the  scene.  Due  to 
strong  a  priori  statistical  information  in  the  form  of  the  material  database  and  knowledge  of  their  standard  morphology,  statistical 
information  provides  a  natural  theoretical  framework  for  assessing  the  content,  acquisition,  and  processing  of  information  by  a 
coded  imaging  system  about  man-made  space  objects.  Here,  we  will  demonstrate  the  use  of  a  statistical  model  of  a  coded  imaging 

system  for  which  we  compute  statistical  information  in  the  coded  measurements  using  a  highly  efficient  Monte-Carlo  based  algo¬ 
rithm  [1],  Specifically,  we  will  study  the  problem  of  encoding  spatial  information  about  the  scene  at  different  wavelengths  into  the 
measurements.  An  important  parameter  that  will  concern  us  is  the  minimum  number  of  spectral  measurements  required  to  unam¬ 
biguously  identify  each  material  in  the  scene,  given  some  level  of  noise  in  the  data.  Due  to  our  information-based  approach,  we 
have  the  ability  to  study  the  effects  of  gradually  introducing  more  prior  information  via  the  material  database  on  the  identification 
of  the  materials  from  the  measurements.  We  expect  interdependence  between  the  choice  of  spatial  measurement  vectors  and  the 
number  of  spectral  channels.  We  anticipate  that  our  information-theoretic  results  will  yield  insight  into  how  to  optimally  choose 
the  measurement  vectors.  Such  knowledge  will  be  crucial  when  designing  a  system  to  identify  materials  with  similar  reflectance 
curves. 


1  Introduction 

An  important  goal  of  SOI  is  the  identification  of  man-made  satellites.  Successful  identification  depends  on  knowledge 
of  the  surface  material  components  and  their  respective  boundaries.  One  source  of  this  information  is  ground-based 
images  of  the  target.  However,  do  to  image  blur  caused  by  both  the  telescope  and  the  turbulent  atmosphere,  these 
measurements  only  convey  partial  information  about  the  materials  and  boundaries. 

The  ability  of  an  electro-optical  (EO)  sensing  system  to  measure  and  extract  the  most  information  possible  from  a 
limited  set  of  measurements  of  the  target  is  a  crucial  design  requirement  for  future  space  surveillance  systems.  First, 
one  must  quantify  the  information  content  of  a  target  and  one  approach  is  to  introduce  a  model  of  the  target  in  terms 
of  shape  primitives  as  shown  in  Fig.  1.  The  target  can  then  be  expressed  as  a  vector  of  parameters,  X,  for  each  shape 
primitive  and  its  corresponding  material  with  a  probability  distribution  function  (PDF)  p(x).  The  differential  Shannon 
entropy  [1],  which  is  defined  as  the  following  self-average  computed  over  this  PDF, 

MX)  =  -J p(X)  In p(X)dx  =  — <lnPjf),  (1) 

is  a  measure  of  the  information-carrying  potential  of  the  random  variable  A'  and  represents  the  maximum  information 
that  can  be  encoded  into  the  variable  X.  The  more  ’’states”  that  are  accessible  to  the  random  variable  X  the  larger  its 
entropy  and  information  carrying  potential. 

When  modeled  as  a  communication  channel,  a  space  surveillance  system  processes  this  input  signal  yielding  a 
channel  output  Y.  The  amount  of  information  in  Y,  that  is  conveyed  about  channel  input  X  is  referred  to  as  the 
mutual  information  (MI)  between  X  and  Y  and  is  defined  as, 

J(X;Y)  =  H(X)-H(X\Y) 

=  H(Y)-  H(Y\X) 

=  H(X)+H(Y)-H(X,Y),  (2) 
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Figure  1 :  (Left)  Hubble  Space  Telescope  (Right)  Model  of  HST  using  shape  primitives  with  examples  of  local  boundaries 
that  accurately  model  the  material  boundaries  on  the  target 


where  the  conditional  entropies,  7T(X|Y)  and  H (Y|X),  and  the  joint  entropy  //  (X,  Y )  are  defined  formally  via 
relation  (1)  with  the  probability  distribution  Px  replaced  by  the  corresponding  conditional  or  joint  PDFs  inside  the 
logarithm.  The  equivalence  of  the  three  forms  in  Eq.(2)  is  easily  established  by  means  of  the  Bayes  theorem. 

2  Local  boundaries 

The  information  measures  in  Eq.  (1)  describe  the  global  information  and,  in  most  cases,  are  computational  prohibitive. 
Alternately,  one  can  compute  information  at  local  places  in  the  target  scene,  which  can  in  many  cases  be  possible 
especially  where  the  boundaries  can  be  expressed  in  terms  of  a  small  set  of  parameters.  This  idealization  is  likely  to  be 
accurate  locally  in  the  vicinity  of  a  boundary  between  two  materials  on  a  more  realistic  object  surface,  while  affording 
a  relatively  simple  information  theoretic  description  of  system  performance  as  shown  in  Fig  2. 
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Figure  2:  Model  of  a  local  boundary  that  consist  of  two  components,  a  circular  component  on  a  background 
surface.  The  circular  component  and  the  background  each  represent  a  surface  material.  The  boundary  of  the 
circular  component  is  described  by  the  radius  of  the  component. 

This  local  material  boundary  problem  at  wavelength  A  has  a  spatial  distribution, 

(x,  A)  =  (A )g(1’a)  (x)  +  i (A )g(2’a)  (x)  (3) 

where  q2  and  qw’a'>  are  the  profile  functions  for  the  two  material  components  that  have  a  common  boundary 
defined  by  a,  the  radius  of  the  circular  component.  The  quantities  i^’^(A)  and  'iS‘2P  (A)  represent  the  brightness  at 
the  individual  pixels  of  the  two  materials  (indexed  by  j  and  /:)  at  wavelength  A.  The  blurred  image  is  a  convolution  of 
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jO  Aa)(Xi  and  the  point-spread-function,  /iA(x),  of  the  telescope, 

gUAa)(x,  A)  =  ]T/^>“)(x,  A)/ia(x'-x) 

x' 

=  *(1>^(A)  ©  /ia)  (x)  +  i(2>/)(A)  (V2’“}  ©  ftA)  (x).  (4) 

We  compute  the  observed  image  in  a  spectral  band  (indexed  by  subscript  w)  of  width  AA  centered  at  wavelength 
An  by  integrating  over  the  wavelengths  in  the  band. 


/*A  o+AA/2 

9&*’a)(x)  =  /  W(A)5^(x,A)dA, 

J  Ao-AA/2 


(5) 


where  W(A)  denotes  the  transmittance  function  of  the  filter.  In  this  paper  we  will  take  l-k (A)  to  be  constant  over  the 
passband  of  the  filter. 

The  typical  compressive  sensing  (CS)  approach  [7]  requires  making  projective  measurements  using  K  spatial 
measurement  vectors  (masks)  defined  in  the  focal  plane  of  the  imaging  system.  The  type  of  mask  is  known  in  advance 
of  the  measurement  and  is  used  in  a  post  facto  image  reconstruction  of  the  original  scene.  Typically  such  masks  are 
taken  to  be  vector  of  components  randomly  and  uniformly  distributed,  but  here  we  use  masks  that  represent  specific 
Fourier  components  which  we  have  found  to  be  more  efficient  for  our  purposes.  We  denote  the  mask  functions  by 
rfc>u,(x),  k  =  1, ...,  K .  The  projective  measurements  are  simply  the  overlap  integrals  of  these  mask  functions  with  the 
image, 

=  J  9w’e’°‘\x)rk,w('x)dx- +  nk,  (6) 

where  the  additive  noise  for  the  kth  measurement  is  assumed  to  be  distributed  according  to  a  Gaussian  PDF  with 
zero  mean  and  variance  a2N. 

The  essential  metric  of  performance  in  the  statistical  IT  description  [1]  we  now  present  is  the  ratio  of  the  mutual 
information,  which  is  the  information  successfully  passed  by  the  measurement  system,  to  the  entropy,  which  represents 
the  maximum  information  presented  to  the  system.  This  ratio  can  be  calculated  for  each  parameter  of  interest  as  a  way 
of  assessing  the  overall  system  performance.  We  demonstrate  our  IT  formalism  first  for  the  special  case  where  the 
boundary  location  a  is  known  a  priori.  In  this  simplified  problem,  the  only  variable  of  interest  is  the  flux  vector, 
I  =  (*(1>f)j  j(24));  whose  two  components  denote  the  flux  density  of  materials  1  and  2,  respectively.  We  shall  take  the 
statistical  distribution  of  material  pairs  to  follow  a  uniform  PDF, 


P(  I) 


N, 


pairs 


(7) 


where  the  total  pair  combinations  of  materials  on  the  target  is  Npairs  =  Nrn(Nrn  —  l)/2  where  Nm  is  the  total  number 
of  material  pairs  in  the  database  (of  spectral  signatures)  of  known  materials.  The  PDF  for  the  additive  noise  vector,  n, 
is  of  the  Gaussian  form 

P(n)  =  7 O - 2Tf©2  eXP  [-nTn/(2oir)]  .  (8) 

(znaN) 

where  the  subscript  T  on  a  vector  or  matrix  represents  its  transpose  and  a  matrix-product  operation  is  understood 
whenever  two  vectors  or  matrices  occur  next  to  each  other. 

For  a  discrete-valued  random  variable,  the  above  self-average  must  be  understood  as  being  calculated  over  the 
discrete  probability  distribution  Px{%i)  via  Eq.  (l).When  the  discrete  probability  distribution  is  derived  from  a  con¬ 
tinuous  PDF  by  integrating  the  latter  over  successive  discreteness  intervals,  or  histogram  bins,  each  of  width  Ax  that 
is  very  small  compared  to  the  scale  over  which  the  PDF  varies,  then,  since  Px{xi)  ~  Px(xi) Ax,  we  may  relate  the 
differential  and  discrete  entropies  as 

H(X)  «  h{X)  -  lnAx.  (9) 

This  relation  breaks  down  when  the  discreteness  interval  Ax,  which  we  may  also  call  the  precision  to  which  the 
continuous  variable  X  is  ’’measured,”  or  simply  the  X-precision,  is  no  longer  small  compared  to  the  width  of  the  PDF. 
In  this  paper  we  shall  assume  that  the  relation  (9)  is  always  valid  and  therefore  we  can  utilize  it  as  a  way  of  computing 
the  discrete  entropy  from  its  corresponding  differential  entropy. 
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Since  MI  is  a  difference  of  two  entropies  for  the  same  variable,  it  is  well  approximated  according  to  relation  (9) 
by  the  difference  of  the  corresponding  differential  entropies  from  which  the  In  SX  terms  drop  out.  As  we  indicated 
earlier,  the  ratio,  1Z  =  I(X;Y) / H (X),  may  be  regarded  as  a  measure  of  the  fidelity  with  which  information  about  X 
can  be  extracted  from  the  measurement  Y. 

We  first  consider  a  sequential  set  of  measurements  performed  in  a  single  wavelength  band  w.  The  conditional  PDF 
for  the  measurement  vector  m,  given  I  and  a,  is  seen  to  be  simply  the  /t'-tlimensional  hyperspherical  Gaussian  PDF 
of  the  IID  noise  vector. 


Pwi  m|I,a) 


1 


K 

n  exp 

k= 1 


(2,c 

111  °W  t'w  °W  ir'w 


II2  /(2 <r%) 


(10) 


where  the  subscript  w  denotes  the  wave  band  and  a  pair  of  double  vertical  bars  enclosing  a  vector  v  denotes  its 
Euclidean  norm,  ||  v  ||=  V vTv.  Because  of  its  product  form  of  the  Gaussian  PDF,  its  differential  entropy  is  simply 

/i(m|I,  a)  =  (K/2)  In  (^necr^)  .  (11) 

We  assume  a  uniform  distribution  on  the  boundary  parameters  and  then  use  Bayes  rule  to  compute  the  conditional 
PDFs,  pw( m|a)  and  p,„(m|I),  and  then  via  Eq.  (2)  compute  the  mutual  information  in  the  measurements  about  the 
surface  materials  of  the  two  components  and  the  parameters  that  describe  the  common  boundary. 

The  measurement  PDF  in  Eq.  (10)  is  for  a  single  waveband.  We  compute  the  measurement  PDF  for  spatially 
compressive  sensing  system,  operating  at  multiple  wavelength  bands  in  the  region  of  the  spectrum  starting  at  0.4pm 
and  ending  at  2.5 pm.,  as  the  IE-fold  product  of  Eq.  (10), 


w 

p(m|I,  a)  =  JJpw(m|I,a). 

W—  1 


(12) 


As  in  the  case  of  a  single  band  measurement,  we  compute  both,  p(m|a)  and  pw(m|I),  and  the  compute  the  mutual 
information  in  the  measurements  performed  in  each  wavelength  band. 

Computation  of  the  statistical  entropies  for  Eq.  (10)  and  Eq.  (12)  relies  on  an  implementation  of  a  Markov 
Chain  Monte  Carlo  (MCMC)  algorithm  that  obtains  a  sequence  of  random  samples  from  the  probability  distribution 
of  interest.  This  sequence  is  used  to  approximate  the  distribution  and  compute  the  statistical  entropies.  The  outline  of 
our  approach  follows.  We  start  with  a  random  measurement  value  m  and  draw  the  next  measurement,  m*,  based  on  a 
proposal  distribution  q  centered  on  m.  We  evaluate  the  ratio. 


/  p(m*)g(m*,mt_1)  \ 

\p(mt_i)g(mt_i,  m*)  ’  ) 


(13) 


and  accept  the  proposed  measurement  value  with  probability  7.  This  implementation  is  referred  to  as  the  Metropolis- 
Hustings  algorithm[3,  4]  and  it  enables  us  to  compute  statistical  entropies  efficiently  in  high  dimensions. 

Important  factors  that  will  reduce  the  amount  of  information  in  the  compressive  measurements  include  the  amount 
of  blur,  the  noise  and  the  choice  of  measurement  mask.  The  image  blur  in  Eq.  (4)  acts  as  a  spatial  filter  that  re¬ 
duces  information  about  the  scene,  particular  at  high  spatial  frequencies.  The  amount  of  filtering  is  greater  at  longer 
wavelengths  than  at  shorter  wavelengths.  Reducing  the  effects  of  noise  on  the  measurement  can  be  accomplished  by 
integrating  over  many  wavelength  channels;  however,  due  to  the  wavelength  dependence  of  the  blur  the  integrated 
image  becomes  a  sum  of  images  with  differing  amounts  of  blur.  The  ability  of  the  measurement  mask  to  capture  infor¬ 
mation  about  the  parameters  of  interest  in  the  presence  of  noise  and  this  integrated  blur  can  be  evaluated  by  computing 
the  mutual  information  for  different  choices  of  masks. 


3  Results 

We  demonstrate  this  analysis  on  the  problem  in  Fig.  2  for  a  spatially  compressive  sensing  system  operating  in  multiple 
wavelength  bands.  In  each  wavelength  band  a  sequential  set  of  measurements  is  performed  using  the  DC  mask,  the 
sine/cosine  for  the  lowest  spatial  frequency  and  the  sine/cosine  for  the  next  lowest  spatial  frequency  for  a  total  of  5 
measurements.  The  masks  for  the  first  three  measurements  are  shown  in  Fig.  3.  Our  results  are  shown  in  Fig.  4. 
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Figure  3:  Plotted  are  the  DC  (zero  Fourier  frequency)  and  the  cosine  and  sine  components  of  the  lowest  spatial 
frequency 


Here  the  blue  curve  (square  symbols)  represents  the  MI  recovered  using  a  single  DC  measurement  mask  at  a  different 
number  of  wavebands.  In  this  scenario  all  the  information  about  the  materials  comes  from  simply  measuring  the  total 
flux  in  each  band.  Clearly,  in  the  limit  of  no  noise  such  an  approach  would  recover  all  the  information  about  the 
materials,  which  in  our  case  is  log2  (NparrH)=  5.81  bits  (here  we  have  expressed  the  recovered  MI  as  a  fraction  of 
the  entropy  H (I)).  As  more  wavebands  used  the  MI  recovered  increases  due  to  the  wavelength-dependent  reflectance 
properties  of  the  materials,  as  they  are  more  easily  identified.  However,  beyond  4  wavebands  the  MI  recovered  begins 
to  decrease  due  to  the  splitting  of  photons  amongst  the  multiple  wave-bands.  Here,  our  information-based  analysis 
shows  the  trade-off  point  when  using  a  mutli-wavelength  compressive  sensing  approach. 

In  addition  to  the  DC  measurement  we  now  included  spatial  measurements  using  the  cosine  and  sine  components  of 
the  lowest  spatial  frequency,  i.e.  we  perform  3  measurements.  One  can  see  the  added  information  by  including  a  spatial 
measurement  with  the  DC  measurement.  Like  the  DC  measurement  alone,  we  also  see  a  decrease  in  information  as 
more  wave  bands  are  introduced;  however  this  decrease  is  mitigated  by  the  additional  measurements.  We  next  include 
the  second  lowest  spatial  frequency  for  at  total  5  measurements.  One  can  now  quantify  the  additional  information 
provided  by  this  measurement,  and  see  further  mitigation  of  the  information  loss  at  10  wavebands. 

3.1  Conclusions 

We  have  analyzed  the  problem  of  SOI  from  spatially  compressive  sensing  measurements  from  an  information  theoretic 
viewpoint .  We  have  shown  that  by  computing  MI  one  can  assess  the  additional  information  acquired  by  using  multiple 
wave  bands.  It  was  shown  that  increasing  the  number  of  wavebands  results  in  an  increase  in  MI,  which  is  due  to  the 
wavelength-dependent  reflectance  properties  of  the  materials.  As  more  wavebands  are  introduced  however,  the  MI 
begins  to  decrease  due  to  the  splitting  of  photons  amongst  the  wavebands.  The  MI  allows  one  to  quantify  this  trade-off 
between  SNR  and  enhanced  material  discrimination. 

The  above  results  rely  heavily  on  the  Metropolis-Hastings  algorithm,  while  the  result  shown  in  Fig.  4  for  five  mea¬ 
surements  in  10  wave  bands  depends  on  computing  the  statistical  entropies,  i.e.  Eq.  (1),  in  a  50-dimensional  parameter 
space.  As  the  complexity  of  our  problem  increases,  we  will  begin  to  encounter  measurement  probability  functions  that 
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Figure  4:  The  MI  recovered  about  boundary  parameters  a  and  J  using  a  spatially  compressive  sensing  system 
operating  at  multiple  wavelength  bands 


can,  depending  on  our  local  boundary  parameterization  and  the  reflectance  properties  of  the  materials  chosen  from  our 
database,  have  multiple  modes.  An  example  is  shown  in  Fig.  5.  Our  next  steps  will  be  the  implementation  of  a  Normal 
Kernel  Coupler  [5]  to  address  entropy  computations  for  similar  cases.  Eventually,  our  simulations  will  be  extended  to 
include  the  case  of  both  a  spatial  and  spectral  compressive  system  [6]  for  more  comprehensive  results. 
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Figure  5:  An  example  of  a  2-dimensional  measurement  probability  function  exhibiting  multiple  modes. 
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